Search results for "Data augmentation"
showing 10 items of 10 documents
Alleviating Class Imbalance Problem in Automatic Sleep Stage Classification
2022
For real-world automatic sleep-stage classification tasks, various existing deep learning-based models are biased toward the majority with a high proportion. Because of the unique sleep structure, most of the current polysomnography (PSG) datasets suffer an inherent class imbalance problem (CIP), in which the number of each sleep stage is severely unequal. In this study, we first define the class imbalance factor (CIF) to describe the level of CIP quantitatively. Afterward, we propose two balancing methods to alleviate this problem from the dataset quantity and the relationship between the class distribution and the applied model, respectively. The first one is to employ the data augmentati…
Improving Irony and Stereotype Spreaders Detection using Data Augmentation and Convolutional Neural Network
2022
In this paper we describe a deep learning model based on a Data Augmentation (DA) layer followed by a Convolutional Neural Network (CNN). The proposed model was developed by our team for the Profiling Irony and Stereotype Spreaders (ISSs) task proposed by the PAN 2022 organizers. As a first step, to classify an author as ISS or not (nISS), we developed a DA layer that expands each sample in the dataset provided. Using this augmented dataset we trained the CNN. Then, to submit our predictions, we apply our DA layer on the samples within the unlabeled test set too. Finally we fed our trained CNN with the augmented test set to generate our final predictions. To develop and test our model we us…
Bayesian models for data missing not at random in health examination surveys
2018
In epidemiological surveys, data missing not at random (MNAR) due to survey nonresponse may potentially lead to a bias in the risk factor estimates. We propose an approach based on Bayesian data augmentation and survival modelling to reduce the nonresponse bias. The approach requires additional information based on follow-up data. We present a case study of smoking prevalence using FINRISK data collected between 1972 and 2007 with a follow-up to the end of 2012 and compare it to other commonly applied missing at random (MAR) imputation approaches. A simulation experiment is carried out to study the validity of the approaches. Our approach appears to reduce the nonresponse bias substantially…
Spatial Bayesian Modeling of Presence-only Data
2011
A visual framework to create photorealistic retinal vessels for diagnosis purposes
2020
The methods developed in recent years for synthesising an ocular fundus can be been divided into two main categories. The first category of methods involves the development of an anatomical model of the eye, where artificial images are generated using appropriate parameters for modelling the vascular networks and fundus. The second type of method has been made possible by the development of deep learning techniques and improvements in the performance of hardware (especially graphics cards equipped with a large number of cores). The methodology proposed here to produce high-resolution synthetic fundus images is intended to be an alternative to the increasingly widespread use of generative ad…
Fast Estimation of Diffusion Tensors under Rician noise by the EM algorithm
2016
Diffusion tensor imaging (DTI) is widely used to characterize, in vivo, the white matter of the central nerve system (CNS). This biological tissue contains much anatomic, structural and orientational information of fibers in human brain. Spectral data from the displacement distribution of water molecules located in the brain tissue are collected by a magnetic resonance scanner and acquired in the Fourier domain. After the Fourier inversion, the noise distribution is Gaussian in both real and imaginary parts and, as a consequence, the recorded magnitude data are corrupted by Rician noise. Statistical estimation of diffusion leads a non-linear regression problem. In this paper, we present a f…
Retinal image synthesis through the least action principle
2020
Eye fundus image analysis is a fundamental approach in medical diagnosis and follow-up ophthalmic diagnostics. Manual annotation by experts needs hard work, thus only a small set of annotated vessel structures is available. Examples such as DRIVE and STARE include small sets for training images of fundus image benchmarks. Moreover, there is no vessel structure annotation for a number of fundus image datasets. Synthetic images have been generated by using appropriate parameters for the modeling of vascular networks or by methods developing deep learning techniques and supported by performance hardware. Our methodology aims to produce high-resolution synthetic fundus images alternative to the…
Prediction of Airport Pavement Moduli by Machine Learning Methodology Using Non-destructive Field Testing Data Augmentation
2022
For the purpose of the Airport Pavement Management System (APMS), in order to optimize the maintenance strategies, it is fundamental monitoring the pavement conditions’ deterioration with time. In this way, the most damaged areas can be detected and intervention can be prioritized. The conventional approach consists in performing non-destructive tests by means of a Heavy Weight Deflectometer (HWD). This equipment allows the measurement of the pavement deflections induced by a defined impact load. This is a quite expensive and time-consuming procedure, therefore, the points to be investigated are usually limited to the center points of a very large mesh grid. Starting from the measured defle…
Data Augmentation Approach in Bayesian Modelling of Presence-only Data
2011
Abstract Ecologists are interested in prediction of potential distribution of species in suitable areas, essential for planning conservation and management strategies. Unfortunately, often the only available information in such studies is the true presence of the species at few locations of the study area and the associated environmental covariates over the entire area, referred as presence-only data. We propose a Bayesian approach to estimate logistic linear regressions adapted to presence-only data through the introduction of a random approximation of the correction factor in the adjusted logistic model that allows us to overcome the need to know a priori the prevalence of the species.
Convolutional Neural Network Based Sleep Stage Classification with Class Imbalance
2022
Accurate sleep stage classification is vital to assess sleep quality and diagnose sleep disorders. Numerous deep learning based models have been designed for accomplishing this labor automatically. However, the class imbalance problem existing in polysomnography (PSG) datasets has been barely investigated in previous studies, which is one of the most challenging obstacles for the real-world sleep staging application. To address this issue, this paper proposes novel methods with signal-driven and image-driven ways of noise addition to balance the imbalanced relationship in the training dataset samples. We evaluate the effectiveness of the proposed methods which are integrated into a convolut…